智能论文笔记

Estimation of Acetabular Version from Anteroposterior Pelvic Radiograph Employing Deep Learning

Ata Jodeiri , Hadi Seyedarabi , Fatemeh Shahbazi , Seyed Mohammad Mahdi Hashemi , Seyyedhossein Shafiei

分类：人工智能 | 计算机视觉 | 机器学习

2021-11-14

背景和目的：髋臼版本，总髋关节置换术的必要因素，通过CT扫描作为金标准测量。 CT的辐射剂量和耗材使前后骨盆射线照相适当的替代程序。在这项研究中，我们在前后骨盆X射线上应用了深度学习方法来测量解剖学版本，消除了使用计算机断层扫描的必要性。方法：使用CT图像计算300名患者臀部的右侧和左侧髋臼版角。所提出的深度学习模型，对骨骼时代的预用-VGG16的注意力应用于包括群体的AP图像。这些人的年龄和性别被添加到最后一个完全连接的注意机制层的其他投入。作为输出，预测两个臀部的角度。结果：随着人们更老的，在CT上增加了CT的髋骨角度，男性的平均值为16.54和16.11（右侧和左角度）。使用所提出的深度学习方法估计右侧角度估计的预测错误位于精确的误差区域（<= 3度），显示了所提出的方法在基于AP图像测量解剖学版本的能力。结论：建议算法，在患者骨盆的AP图像上应用预训练的VGG16，其次是考虑年龄和患者性别的注意力模型，可以仅使用AP射线照片准确评估版本，同时避免CT扫描的需要。基于AP盆景的解剖学髋臼版本的应用技术，使用DL方法，迄今为止尚未发布。

translated by 谷歌翻译

Graph Federated Learning for CIoT Devices in Smart Home Applications

Arash Rasti-Meymandi , Seyed Mohammad Sheikholeslami , Jamshid Abouei , Konstantinos N. Plataniotis

分类：机器学习

2022-12-29

This paper deals with the problem of statistical and system heterogeneity in a cross-silo Federated Learning (FL) framework where there exist a limited number of Consumer Internet of Things (CIoT) devices in a smart building. We propose a novel Graph Signal Processing (GSP)-inspired aggregation rule based on graph filtering dubbed ``G-Fedfilt''. The proposed aggregator enables a structured flow of information based on the graph's topology. This behavior allows capturing the interconnection of CIoT devices and training domain-specific models. The embedded graph filter is equipped with a tunable parameter which enables a continuous trade-off between domain-agnostic and domain-specific FL. In the case of domain-agnostic, it forces G-Fedfilt to act similar to the conventional Federated Averaging (FedAvg) aggregation rule. The proposed G-Fedfilt also enables an intrinsic smooth clustering based on the graph connectivity without explicitly specified which further boosts the personalization of the models in the framework. In addition, the proposed scheme enjoys a communication-efficient time-scheduling to alleviate the system heterogeneity. This is accomplished by adaptively adjusting the amount of training data samples and sparsity of the models' gradients to reduce communication desynchronization and latency. Simulation results show that the proposed G-Fedfilt achieves up to $3.99\% $ better classification accuracy than the conventional FedAvg when concerning model personalization on the statistically heterogeneous local datasets, while it is capable of yielding up to $2.41\%$ higher accuracy than FedAvg in the case of testing the generalization of the models.

translated by 谷歌翻译

Weakly-Supervised Deep Learning Model for Prostate Cancer Diagnosis and Gleason Grading of Histopathology Images

Mohammad Mahdi Behzadi , Mohammad Madani , Hanzhang Wang , Jun Bai , Ankit Bhardwaj , Anna Tarakanova , Harold Yamase , Ga Hie Nam , Sheida Nabavi

分类：计算机视觉

2022-12-25

Prostate cancer is the most common cancer in men worldwide and the second leading cause of cancer death in the United States. One of the prognostic features in prostate cancer is the Gleason grading of histopathology images. The Gleason grade is assigned based on tumor architecture on Hematoxylin and Eosin (H&E) stained whole slide images (WSI) by the pathologists. This process is time-consuming and has known interobserver variability. In the past few years, deep learning algorithms have been used to analyze histopathology images, delivering promising results for grading prostate cancer. However, most of the algorithms rely on the fully annotated datasets which are expensive to generate. In this work, we proposed a novel weakly-supervised algorithm to classify prostate cancer grades. The proposed algorithm consists of three steps: (1) extracting discriminative areas in a histopathology image by employing the Multiple Instance Learning (MIL) algorithm based on Transformers, (2) representing the image by constructing a graph using the discriminative patches, and (3) classifying the image into its Gleason grades by developing a Graph Convolutional Neural Network (GCN) based on the gated attention mechanism. We evaluated our algorithm using publicly available datasets, including TCGAPRAD, PANDA, and Gleason 2019 challenge datasets. We also cross validated the algorithm on an independent dataset. Results show that the proposed model achieved state-of-the-art performance in the Gleason grading task in terms of accuracy, F1 score, and cohen-kappa. The code is available at https://github.com/NabaviLab/Prostate-Cancer.

translated by 谷歌翻译

Multi-Server Over-the-Air Federated Learning

Seyed Mohammad Azimi-Abarghouyi , Viktoria Fodor

分类：人工智能 | 机器学习

2022-11-29

In this work, we propose a communication-efficient two-layer federated learning algorithm for distributed setups including a core server and multiple edge servers with clusters of devices. Assuming different learning tasks, clusters with a same task collaborate. To implement the algorithm over wireless links, we propose a scalable clustered over-the-air aggregation scheme for the uplink with a bandwidth-limited broadcast scheme for the downlink that requires only two single resource blocks for each algorithm iteration, independent of the number of edge servers and devices. This setup is faced with interference of devices in the uplink and interference of edge servers in the downlink that are to be modeled rigorously. We first develop a spatial model for the setup by modeling devices as a Poisson cluster process over the edge servers and quantify uplink and downlink error terms due to the interference. Accordingly, we present a comprehensive mathematical approach to derive the convergence bound for the proposed algorithm including any number of collaborating clusters in the setup and provide important special cases and design remarks. Finally, we show that despite the interference in the proposed uplink and downlink schemes, the proposed algorithm achieves high learning accuracy for a variety of parameters.

translated by 谷歌翻译

Supervised Class-pairwise NMF for Data Representation and Classification

Rachid Hedjam , Abdelhamid Abdesselam , Seyed Mohammad Jafar Jalali , Imran Khan , Samir Brahim Belhaouari

分类：机器学习

2022-09-28

基于各种非负矩阵分解（NMF）方法为成本函数添加了新术语，以使模型适应特定任务，例如聚类或保留减少空间中的某些结构属性（例如，局部不变性）。附加的术语主要由高参数加权，以控制整体公式的平衡，以指导优化过程实现目标。结果是一种参数化的NMF方法。但是，NMF方法采用了无监督的方法来估计分解矩阵。因此，不能保证使用新的特征执行预测（例如分类）的能力。这项工作的目的是设计一个进化框架，以学习参数化NMF的超参数，并以监督的方式估算分解矩阵，以更适合分类问题。此外，我们声称，将基于NMF的算法分别应用于不同的类对，而不是将其应用于整个数据集，从而提高了矩阵分解过程的有效性。这导致训练具有不同平衡参数值的多个参数化的NMF算法。采用了交叉验证组合学习框架，并使用遗传算法来识别最佳参数值集。我们对真实和合成数据集进行的实验证明了所提出的方法的有效性。

translated by 谷歌翻译

STPOTR: Simultaneous Human Trajectory and Pose Prediction Using a Non-Autoregressive Transformer for Robot Following Ahead

Mohammad Mahdavian , Payam Nikdel , Mahdi TaherAhmadi , Mo Chen

分类：机器人 | 机器学习

2022-09-15

在本文中，我们开发了一个神经网络模型，以从观察到的人类运动历史中预测未来的人类运动。我们提出了一种非自动回归的变压器体系结构，以利用其平行性质，以便在测试时更容易训练和快速，准确的预测。所提出的结构将人类运动预测分为两个部分：1）人类轨迹，即随着时间的推移，髋关节3D位置和2）人类姿势，这是所有其他关节3D位置，相对于固定的髋关节。我们建议同时做出两个预测，因为共享表示可以改善模型性能。因此，该模型由两组编码器和解码器组成。首先，应用于编码器输出的多头注意模块改善了人类轨迹。其次，应用于与解码器输出相连的编码器输出的另一个多头自发项模块有助于学习时间依赖性。我们的模型非常适合于测试准确性和速度方面的机器人应用，并且相对于最新方法比较。我们通过机器人跟踪任务证明了我们作品的现实适用性，这是我们提议的模型充满挑战而实用的案例研究。

translated by 谷歌翻译

IR-LPR: Large Scale of Iranian License Plate Recognition Dataset

Mahdi Rahmani , Melika Sabaghian , Seyyede Mahila Moghadami , Mohammad Mohsen Talaie , Mahdi Naghibi , Mohammad Ali Keyvanrad

分类：计算机视觉

2022-09-10

对象检测一直是实用的。我们世界上有很多事情，以至于认识到它们不仅可以增加我们对周围环境的自动知识，而且对于有兴趣开展新业务的人来说也可以很有利润。这些有吸引力的物体之一是车牌（LP）。除了可以使用车牌检测的安全用途外，它还可以用于创建创意业务。随着基于深度学习模型的对象检测方法的开发，适当且全面的数据集变得双重重要。但是，由于频繁使用车牌数据集的商业使用，不仅在伊朗而且在世界范围内也有限。用于检测车牌的最大伊朗数据集具有1,466张图像。此外，识别车牌角色的最大伊朗数据集具有5,000张图像。我们已经准备了一个完整的数据集，其中包括20,967辆汽车图像，以及对整个车牌及其字符的所有检测注释，这对于各种目的都是有用的。此外，字符识别应用程序的车牌图像总数为27,745张图像。

translated by 谷歌翻译

Elderly Fall Detection Using CCTV Cameras under Partial Occlusion of the Subjects Body

Sara Khalili , Hoda Mohammadzade , Mohammad Mahdi Ahmadi

分类：计算机视觉

2022-08-15

老年人在日常生活中面临的可能危险之一就是下降。闭塞是基于视觉的秋季检测系统的最大挑战之一，并大大降低了其检测性能。为了解决这个问题，我们合成了专门设计的遮挡视频，用于使用现有数据集训练秋季检测系统。然后，通过定义新的成本功能，我们引入了一个使用遮挡和未封闭式视频的秋季检测模型加权训练的框架，该视频可应用于任何可学习的秋季检测系统。最后，我们同时使用非深度和深层模型来评估所提出的加权训练方法的效果。实验表明，在遮挡条件下，提出的方法可以将分类精度提高36％，而对于深层模型，则可以将分类精度提高36％。此外，结果表明，所提出的训练框架还可以显着提高正常未封闭样品的深网的检测性能。

translated by 谷歌翻译

Inference of Regulatory Networks Through Temporally Sparse Data

Mohammad Alali , Mahdi Imani

分类：机器学习 | (统计)机器学习

2022-07-21

基因组学的主要目标是正确捕获基因调节网络（GRN）的复杂动力学行为。这包括推断基因之间的复杂相互作用，这些基因可用于广泛的基因组学分析，包括诊断或预后疾病以及为诸如癌症等慢性疾病的有效治疗方法。布尔网络已成为成功捕获GRN行为的成功类型。在大多数实用的环境中，应通过有限且时间稀疏的基因组数据来实现GRN的推论。 GRN中的大量基因会导致大量可能的拓扑候选空间，由于计算资源的限制，通常无法详尽地搜索。本文使用贝叶斯优化和基于内核的方法为GRN提供了可扩展有效的拓扑推断。所提出的方法不是对可能的拓扑结构进行详尽的搜索，而是构建具有拓扑启发的内核函数的高斯过程（GP），以说明可能性函数的相关性。然后，使用GP模型的后验分布，贝叶斯优化有效地搜索具有最高可能性值的拓扑，通过在勘探和剥削之间进行最佳平衡。通过使用众所周知的哺乳动物细胞周期网络的综合数值实验来证明所提出的方法的性能。

translated by 谷歌翻译

Learning-based Monocular 3D Reconstruction of Birds: A Contemporary Survey

Seyed Mojtaba Marvasti-Zadeh , Mohammad N. S. Jahromi , Javad Khaghanix , Devin Goodsman , Nilanjan Ray , Nadir Erbilgin

分类：计算机视觉

2022-07-10

在自然界中，动物的集体行为（例如飞鸟）由同一物种的个体之间的相互作用主导。但是，对鸟类物种中这种行为的研究是一个复杂的过程，即人类无法使用常规的视觉观察技术（例如自然界的焦点采样）进行。对于鸟类等社会动物，群体形成的机制可以帮助生态学家了解社交线索及其视觉特征随着时间的流逝（例如姿势和形状）之间的关系。但是，恢复飞行鸟类的不同姿势和形状是一个极具挑战性的问题。解决此瓶颈的一种广泛的解决方案是将姿势和形状从2D图像提取到3D对应关系。 3D视觉的最新进展导致了关于3D形状和姿势估计的许多令人印象深刻的作品，每项作品都有不同的利弊。据我们所知，这项工作是首次尝试概述基于单眼视觉的3D鸟重建的最新进展，使计算机视觉和生物学研究人员概述了现有方法，并比较其特征。

translated by 谷歌翻译